Model Selection

Efficient quantized inference

# Efficient quantized inference

Mistral Small 3.1 24B Instruct 2503 Q4 K M GGUF

This is a GGUF format model converted from mistralai/Mistral-Small-3.1-24B-Instruct-2503, supporting multilingual text generation tasks.

Large Language Model Supports Multiple Languages

LGAI EXAONE EXAONE Deep 2.4B GGUF

This is the quantized version of LGAI-EXAONE's EXAONE-Deep-2.4B model, quantized using llama.cpp, supporting English and Korean text generation tasks.

Large Language Model Supports Multiple Languages

T5 3b Q4 K M GGUF

This model is a quantized version converted from google-t5/t5-3b to GGUF format using llama.cpp via ggml.ai's GGUF-my-repo space.

Machine Translation Supports Multiple Languages

Finance LLM GGUF

Finance LLM is a language model specialized in the financial domain, based on the Llama architecture, fine-tuned with datasets such as OpenOrca, Lima, and WizardLM.

Large Language Model English

Flan T5 Xxl Sharded Fp16

FLAN-T5 XXL is a variant of Google's T5 model, fine-tuned on over 1,000 additional tasks, supports multiple languages, and outperforms the original T5 model.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase